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ABSTRACT 

The recent coronavirus disease (COVID-19) is extending very speedily over 
the world for the sake of its very infectious nature and is announced 
nationwide by the world health organization (WHO). The (COVID-19) is a 
group of coronavirus that has caused panic all over the world. It enters 
people through the sneezing and coughing of the infected person and 
weakens the person and it then slowly infects the affected person’s lungs. 
In this study, we have classified the chest X-Ray images like Covid-19 
infected chest images or normal chest images. Classifying the chest X-Ray 
images is hard and time-consuming work for human beings. Hence, an 
automatic Covid-19 infected chest X-Ray image or normal chest 
classification tool is very useful even for experience humans to classify a lot 
of chest X-Ray images. For that, we have proposed a new machine learning 
technique to automatically classify the chest Covid-19 infected X-Ray 
images or normal chest images. Hence, we have used a Machine learning 
(ML) model like Support Vector Machine (SVM) to classify (Covid-19) 
infected chest images and normal chest images. For this work, at first, we 
have preprocessed the chest X-Ray image. Then we have extracted the 
distinct features from the chest X-Ray images. After that, these features 
have trained into Machine Learning (ML) algorithm and finally classify 
these images into the category. From the experiment, The Support Vector 
Machine (SVM) models achieving an accuracy of up to 93.1%. 
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1. INTRODUCTION 

Covid-19 has become a burning issue in today's world. 
This virus has created a difficult situation all over the 
world. The outbreak began in late December 2019 in Wan, 
Hubei Province, China, and has since spread to 218 regions 
around the world, including Bangladesh. On March 8, 
2020, the first case of the virus was identified in the 
country and ten days later, on March 18, the first person 
died of the virus, then the rate of infection gradually 
increased in Bangladesh[1].In the last two to three months 
of last year, the corona infestation and mortality graph in 
Bangladesh was very low. But in February and the end of 
March this year, there was a sudden increase in corona 
infestation and mortality at a time when people were not 
as alert. Due to which the number of patients infected with 
corona is gradually increasing in the big cities of 
Bangladesh. According to the Department of Health, as of 
May 11, 2021, the number of patients suffering from 
compassion in Bangladesh is over 7 lakh 76 thousand and 
the number of deaths due to compassion is 12005. The 
human body suffers from various types of damage when it 
is infected with the Covid-19, most notably the human 
airways and lungs. 


Health experts have already said that these people have 
been infected with the Covid-19, their lungs have been 
largely destroyed and they will never fully recover. 


the terms of the Creative Commons 
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Covid-19 lungs are severely damaged and the most 
common symptoms are severe shortness of breath, cough, 
and fatigue. An X-ray of an infected person's chest reveals 
the condition of the person's lungs and how much damage 
has been done. Over the last year, there has been much 
research on Covid-19 chest X-Ray images in the world 
among the research includes X-Ray image classification, 
lung image, tumor classification, blood cell detection, etc. 
There is currently a lot of research being done on 
coronavirus infected lung images using machine learning 
algorithms. In recent years, image processing plays an 
important role in the part of machine learning[2]. Image 
processing (IP) means fetching necessary knowledge from 
the image. The X-Ray image classification task much like 
general image classification like a cat, dog, cow, etc[3]. 
Presently, X-Ray chest image classification is a significant 
thing to identify lung infection type or situation of the lung 
and detects their levels. 


In this work, a novel technique is illustrated which is the 
organization of one classification model, Support Vector 
Machine (SVM) with various distinct sets of features[4]. 
The features learning are average red, green, blue color, 
hue, saturation, values color, and horizontal and vertical 
contrast, horizontal and vertical correlation, horizontal 
and vertical energy, horizontal and vertical homogeneity, 
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gray-level co-occurrence matrix (GLCM), and compost 
automatic various features for SVM algorithm and their 
convention. These features have complied to learn the 
machine learning (ML) models for classifying the X-Ray 
chest image. 


In our research work, our image dataset contained 140 X- 
Ray images that are categorized into two classes of chests 
namely normal chest and covid-19 infected chest images. 
From the experimental result, we have investigated that 
the SVM algorithm classifies the normal chest image and 
Covid-19 chest image. From our observation, the Support 
Vector Machine (SVM) model illustrates the accuracy is 
(93.9%). 


The rest of this paper is as follows. Section 2 describes the 
literature review. Section 3 represents the dataset and 
methodology. Section 4 depicts the results Section 5 
displays the conclusion. 


Z. Literature Review 

Saurabh Kumar et al. in [5] tried to classify chest X-Ray 
images using deep learning. They have scaled all the 
images of the data set to a uniform size of 512x512. Here 
they have used 401 images for these classification 
problems. This dataset contains 401 images out of which 
262 images are covid-19 negative and the rest of these 
images are covid-19 positive and they have achieved a 
high accuracy result. For a good result, the deep learning 
model required huge image data but they have used only 
401 images sample for this classification problem. So it 
was the limitation of their research work. 


The authors in [6] described medical imaging such as X- 
ray and Computed Tomography (CT) associated with the 
potential of Artificial Intelligence (AI) plays a vital role in 
siding the medical staff in the diagnosis process. These 
types of image classification they have used five deep 
learning algorithm namely (ResNet18, ResNet34, 
InceptionV3, InceptionResNetV2, and DenseNet161). In 
this study, they used two public datasets and the first 
dataset was the COVID-19 image data _ collection, 
consisting of 236 images of COVID-19, 12 images of 
COVID-19 and ARDS, 4 images of ARDS, 1 image of 
Chlamydophila, 1 image of Klebsiella, 2 images of 
Legionella, 12 images of Pneumocystis, 16 images of SARS, 
13 images of Streptococcus and 5 images without any 
pathological findings and the second dataset was covid-19 
X-Ray images. Here, they have used a deep learning model 
for classifying the covid-19 positive or negative case. The 
performance of the Deep Learning Algorithm depends on 
its data set which means the amount of data needed for 





Training Image 











good results but here they have used very few images for 
their work so the algorithm may provide bias results. 


Shelke et al. in [7]were described the classification model 
to classify the X-Ray image. Here, they have used four 
classes of X-Ray images namely normal, pneumonia, 
tuberculosis (TB), and COVID-19. They have used a deep 
learning model used for the classification of pneumonia, 
TB, and normal is VGG16 with an accuracy of 95.9 %. But 
they don't explain how many images they've used for their 
research that is the drawback of their work. 


Another CXR image classification task was described 
here[8]. Here, the authors have used the deep Residual 
Network (RESNET-50). In this classification, they have 
used four types of CXR image cases like healthy 
individuals, bacterial and viral pneumonia, and COVID-19 
positives patients. The model performance metrics 
showed an accuracy of 99%. But a large amount of data 
are required for the deep learning algorithm for high 
accuracy but a small number of images were used for 
these purposes which is the limitation of their research. 


The authors in [9] were described the rapid development 
in the area of Machine Learning (ML) and Deep Learning. 
Here, they have proposed intelligent systems to classify 
between Pneumonia and Normal patients. The proposes of 
the machine learning-based classification of the extracted 
deep feature using ResNet152 with COVID-19 and 
Pneumonia patients on chest X-ray images. SMOTE was 
used for balancing the imbalanced data points of COVID- 
19 and Normal patients. The model has achieved accuracy 
up to .97% on Random Forest and 97.4%n using XGBoost 
predictive classifiers. 


3. Methodology and Dataset 

In this part, the algorithm and datasets are described. The 
algorithm is used to classify covid-19 infected and normal 
chest X-Ray images. The algorithm includes SVM (Support 
Vector Machine). The chest X-Ray images of two levels of 
normal and covid-19 are provided as the dataset in the 
algorithm. 


3.1. Proposed System 

The proposed system block diagram is shown here. The 
block diagram is shown in Fig.3.1. The training images are 
resized, and many pre-processing are executed. So that the 
original images may carry many noise and errors. Then 
the contrast of the images is increased. Many features are 
extracted and then the machine learning model is applied 
to classify the images. When the training is finished, the 
accuracy of the model is calculated using the test images 
and confusion matrix. 
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Fig. 3.1: Proposed System Block Diagram 
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The algorithm of the proposed system is illustrated in Algorithm 1. This algorithm represents the steps of the proposed 
model. 

Algorithm 1: Covid-19 chest X-Ray image classification 
Read training images from the dataset 

Pre-process images 

Train the model SVM to classify 

Read test image 

Apply the trained model to classify the test image 
Control the classification_error 

If (False_result> threshold) 

Jump to step 1. 

Else 

achieve accuracy. 


NO UB WN RP 


3.2. Dataset Description 
In this work, we have collected many chest X-Ray images from different Medical colleges and private Hospitalsof the 
Mymensingh and Tangail districts in Bangladesh. 





Cowvid-19 infected Chest Mormal Chest 
Fig 3.2: Two levels Chest X-Ray Images 


These chest X-Ray images having two levels were taken from different Medical colleges and private Hospitals of the 
Mymensingh and Tangail, Dhaka, Natore districts in Bangladesh. The chest X-Ray images are then categorized into two 
classes based on two levels of covid-19 infected and normal chest depicted in the figure. 3.2. Each class has more than 
seventy images. Details of each class are shown in Table 3.1. These images are divided randomly into training (70%) and 
testing (30%) images. 


Table 3.1: Details of Image Dataset 






Class Name Label No. of image 


Covid-19 Infected image Lac” 75 


Total Images 154 


3.3. Image Pre-processing 

Training images are collected from different places may be of various resolution and with noise. Therefore, image pre- 
processing is needed for reshaping the images and avoiding the noise. We have used Matlabresize () function to resize the 
pictures to 350*180 pixels, filter () function for smoothing, sharpening. Here, we have also used the gray2rgb () function 
for eliminating the hue and saturation information, gray_comatrix () function for using texture analysis of the images. 


Tab 3.2: Image Preprocessing Function 
ML Model Description Use the Matlab Functions 


Resize to 350 x 180 pixels 
Filtering for smoothing, sharpening filter Q) 


The gray2rgb function for eliminating the hue and aie) 
saturation information of the i Meciata Aclin aesaseen eae —— a oey 


Graycomatrix function for texture } Graycomatrix function for texture analysis of the images. of the images. graycomatrix | graycomatrix() 


3.4. Feature Extraction 
Feature learning is the next processing step in image analysis. It can be used for images. Feature extraction is the 
measurable character of an image or object. 
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Tab 3.3: Feature Extraction Function of machine learning model 


ML ste WEE 
Model Feature Name Description ere Eee 


The average value of all red pixels in the chest Rave=uint8 
Average red color , 

X-Ray image surface (mean ()) 

The average value of all green pixels in the chest Gave=uint8 
Average green color 

X-Ray image surface mean 

The average value of all blue pixels in the chest Bave=uint8 
Average blue color 

X-Ray image surface (mean ()) 

The average value of all hue pixels in HSV the Have=uint8 
Average hue color , 

SVM chest X-Ray image surface mean 


Aacacaeaturanewcolor The average value of all saturation pixels in HSV Save=uint8 
6 the chest X-Ray image surface (mean ()) 
The average value of all values pixels in HSV Save=uint8& 
Average values color ; 
chest X-Ray image surface mean 
arenes ecical Contras Find the local variation of the gray-level co- 
occurrence matrix 
Horizontal and Vertical Correlation nee eae prose eccunreneObEse Correlation () 
specified pixel pairs. 


3.5. Chest X-Ray Image Classification or prediction 

It classifies the target chest X-Ray class into a predefined input image using a machine learning algorithm. After the 
finishing of the feature extraction process, the images are first trained through the machine learning SVM (Support Vector 
Machine) model, and when the finishes the training process, we have examined with a single image that is not training by 
the trained classifier. The testing classifier compares the new testing image with previously trained different image levels. 
When the testing image is equalized the levels train images the machine learning model classifies the target chest X-Ray 
image levels. 





3.6. Machine Learning Models 

Machine learning (ML) is the data analytical design that instructs computers to do what comes naturally to human study 
from knowledge. Machine learning (ML) models use the computational process to learn facts immediately from data 
without depending on a predetermined equation as an algorithm[10].The models adaptively progress their achievement 
as the number of samples gain able for studying increases. In this work, we have to study Support Vector Machine (SVM) 
machine learning algorithms to complete the automatic covid-19 infected chest image classification work. 


3.7. Support Vector Machine (SVM) Model 

Support vector machine is a supervised Machine Learning (ML) algorithm. It can be used for both classification and 
regression problems[11]. But it is popularly used for classification. The primary goal of the Support Vector Machine (SVM) 
is to find the hyper plane which divides the two classes of data. 


fC OPti mal Hyperplane 
Normal X-Ray 


Image 
oa" @ Covid-19 infected X-Ray 
Be jhe @& @ Image 
te | cd Class 
a ®e®,e 
te ® » 
Be | & 


ll di,” 


Support Vectors 
Fig.3.3: Internal Structure of Support Vector Machine (SVM) 


In most cases, a support vector machine is used to classify the data. A hyper plane which partitions two class with 
maximum distance is called an optimal hyper plane. SVM is very skillful for the supervised classifier. In this work, the 
Support Vector Machine (SVM) algorithm is used for two types of chest X-Ray image classification shown in (Fig.3.3). An 
SVM model is a representation of the examples as points in the coordinate system, mapped so that the sample of the two 
classes are divided by a clear gap that is as wide as possible. Given a training set of two classes, G = { (xi, Yi), i= 1... N} with 
a hyperplane W’ ¢(x,;) + b = 0, xi € Rn and y e€ {1, -1}, the support vector machine satisfies the following conditions: 


(W*$(x;) +b) = Lif yi=1, (3.5) 
(W* p(x) + b) S$ -1,if yi = -1, (3.6) 
Or equivalently, 
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Where @ is the function that maps training vector xi to the higher dimensional space when the data points are linearly 
separable. The distance from a point xi to the hyper plane is: 
1(W! b(xj)+b)I 

||w||? 


From the definition of SVM, the margin is 


(3.8) 


—_ Hence, the equation of hyper plane is (Zl) = - [|W ||? 


|W 
(3.9) 
According to the saddle point of the Lagrange function, the solution of the above equation is, 
1 
by: = 5|Iwll? — Dia aiLyi(w7 Ce) + b) - 1] (3.10) 


where are the nonnegative Lang range multipliers. When the data is not separable, a new slack variable is 
introduced and the optimization equation is 


yi(W" p(x) + b) 21-§; (3.11) 

And the hyper plane equation is- (2, Bl) = - IlwI|* +¢U7, & (3.12) 

where C is a positive constant parameter used as a penalty parameter for the error term. If the optimization of the support 
vector machine uses linear and radial basis function, then the equation is: K(x,;,x;) = x =X 


K(x%;,x;) = expexp (—y|Ix, — xl"). >0 (3.13) 
Where y is the kernel parameter. Figure. 3.3 represents the visualization of SVM. The main advantage of the support vector 
machine is that it is effective in high dimensional spaces and it also works well with a clear margin of separation. The 
primary drawback of the support vector machine is that it does not well perform when the data set is large. The support 
vector machine also low performance, the data set, is rowdy. 


4. Result Analysis 

In this part, we have discussed the classification performance of the support vector machine for these two classes chest X- 
Ray image prediction task. The Confusion Matrix (CM) is a table that mostly behaves to consider the achievement of a 
classification model on a Set of testing data for which the true values are acquainted[12]. The Confusion Matrix (CM) itself 
is comparatively easy to realize for that many researchers use it. 


Average Accuracy of 5VM 


loved, & || Wer cheech Pots Sl 


1 


Aon al fe Sl 


Output Class 








Target Clas 
Fig. 4.1: Confusion Matrix of Support Vector Machine 


In (Fig.4.1) we see that diagonally shaded boxes display the percent accuracy result of the SVM model. On the other side 
shaded box illustrated the percent of mistakes for the classification problem. The average accuracy of the support vector 
machine classifier is depicted for the classification problem is 93.1% that means the average achievement accuracy for the 
overall classifier with the best value is (93.1%). In common mistake for the overall classifier with the value is (6.9%). In 
this study, we see that’s the covid-19_infected_chesthas displayed the highest and theNormal_chest illustrated the lowest 
classification accuracy respectively. 


5. Conclusion 
In the study, we have proposed a novel technic to classify 
Chest X-Ray images with two-level using the Machine 


classification. Image Pre-processing means resizing the 
image, noise avoiding. Then, we have extracted distinct 
features like RGB color, HSV color, horizontal and vertical 


Learning (ML) model. The raised system is used machine 
learning models to automatically classify the Covid-19 
infected chest X-Ray image and Normal chest X-Ray image. 
Our proposed system includes three phases: Image pre- 
processing features learning or_ extraction, and 


contrast, etc are extracted. In fine, the classification part 
can be performed after the feature vectors are propagated 
for every image. The Support vector machine (SVM) 
classifier is used for the classification problem. Our 
proposed system has been performed evaluated using 
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154Chest X-Ray images. From among these images, we 
have used 70 % images for training and 30% images for 
testing randomly. From the experimental result, we have 
achieved accuracy up to 93.1%. 
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